On the significance of the reference ratio method in inferential structure determination of biomolecules
نویسندگان
چکیده
The inference of biomolecular structure from biophysical data is an important task in molecular biology. Rigorous Bayesian inference requires the formulation of a joint posterior distribution regarding structure and nuisance parameters given the observed data. The relationship between biomolecule, typically a vector of atomic coordinates, and data is typically many-to-one. Consequently, forward models are needed to relate a given biomolecular structure to its associated data. Thus, the calculation of the likelihood involves projecting a high dimensional manifold to a lower dimensional one, respectively concerning structure and data. This projection is a reduced or coarse grained representation of the structure. Given the nature of the data obtained from the biophysical experiments, the use of prior distributions concerning biomolecular structure is indispensable. A prior on biomolecular structure necessarily also induces a prior on its reduced or coarse-grained representation. We call this induced prior the reference distribution. The reference distribution induced by a fine-grained prior is typically assumed to be suitable for the coarse-grained variable. Often, this assumption is invalid. Here, we quantify the impact of the induced reference distribution on the posterior distribution and discuss its possible implications. 1 Background Bio-molecular function is closely connected to structure. Consequently, the inference of structure from biophysical experiments is an important problem in molecular biology. However, the procedure is complicated by the nature of the experimental data which, are incomplete, averaged and subject to experimental noise. This fact makes it difficult, if not impossible, to use experimental observations to determine structures without the use of strong prior information. The determination of bio-molecular structure usually concerns an atomic-level, or finegrained representation, x ∈ N . The experimental observations, d, provide information of some projection f of x. The relationship between f and x is given by a forward model as f = F(x,θ) ∈ M where F : N ,O → M, dim(N ) >> dim(M), with nuisance parameters θ ∈ O. That is, there is a deterministic relationship between the fine-grained x and coarsegrained representation f , through F . In a practical structure inference setting, prior information on the fine-grained space N is typically introduced to compensate for the noisy and incomplete nature of experimental data. This inadvertently results in a prior distribution on the coarse-grained space M, as a consequence of the variables’ deterministic relationship. While the prior information about N may seem entirely appropriate, this is not guaranteed for the resulting induced prior distribution on M. Moreover, if the induced prior is inappropriate, other free parameters, such as θ ∈ O, may compensate for this. This will in turn introduce a bias which may hamper the direct interpretation of θ. Here, the aim is to demonstrate that the prior information used needs to be appropriate with respect to both N and M.
منابع مشابه
On the Polymorphism of 12-Tungstoborate Heteropolyanion: Structure Determination and Its Functionalization with L-proline
A new structure related to previously reported structure of 12-tungstoborate Keggin-type polyoxometalate, K5[BW12O40], was synthesized and its characterization by single crystal X-ray diffraction shows the polymorph structure. Further attempts have been performed to provide three component compounds based on L-proline, lanthanoid cation and K5[BW12O40] (BW12 (II)) under hydrothermal conditions ...
متن کاملLeast-squares support vector machine and its application in the simultaneous quantitative spectrophotometric determination of pharmaceutical ternary mixture
This paper proposes the least-squares support vector machine (LS-SVM) as an intelligent method applied on absorption spectra for the simultaneous determination of paracetamol (PCT), caffeine (CAF) and ibuprofen (IB) in Novafen. The signal to noise ratio (S/N) increased. Also, In the LS - SVM model, Kernel parameter (σ2) and capacity factor (C) were optimized. Excellent prediction was shown usin...
متن کاملPreparation and Study of Molecular Structure of Copper Ions Doped in a Silica Xerogel Matrix
The silica xerogel is prepared using copper source; 0.02 mol of Cu(NO3)2.3H2O that has been added to 1 mol TEOS (tetraethyl orthosilicate). The Copper ions are doped to silica matrix by the sol gel method and determination of total molar ratio of components with a reported molar ratio of H2O /TEOS(R) = 6.2. In this method, the acidity the of reaction (pH) depends on the catalyst type in the hyd...
متن کاملSimultaneous Quantitation of Theophylline and Guaifenesin in Syrup by HPLC, Derivative and Derivative Ratio Spectrophotometry for Quality Control Purposes
The aim of the present work was to develop a simple and rapid method for determination of theophylline (THP) and guaifenesin (GU) in syrup without involving any preparation operations like separation or masking. A HPLC and two spectrophotometric methods based on the derivation of the main spectra are described for the determination of THP and GU in combined pharmaceutical syrup form. The first ...
متن کاملDevelopment and Validation of an Ion Chromatography Method for Quantification of Ammonium Ions in STEALTH® Liposomes
Ammonium sulfate is one of the subsidiary components in the stealth liposome structure. The ratio of ammonium ion bound to liposome sphere to ammonium ions outside the liposome plays an important role in drug delivery formulation; accordingly, in order to quantify the ammonium ion in the liposome structure, a rapid and sensitive method was validated using a conductivity detector. Through this m...
متن کاملDetermination of the best-fitting reference orbit for a LEO satellite using the Lagrange coefficients
Linearization of the nonlinear equations and iterative solution is the most well-known scheme in many engineering problems. For geodetic applications of the LEO satellites, e.g. the Earth’s gravity field recovery, one needs to provide an initial guess of the satellite location or the so-called reference orbit. Numerical integration can be utilized for generating the reference orbit if a satelli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013